CzAccent - Simple Tool for Restoring Accents in Czech Texts

نویسنده

  • Pavel Rychlý
چکیده

There are many Czech text written without any accents. The paper describes a tool for fully automatic restoration of Czech accents. The system is based on a simple approach of big lexicon. The resulting accuracy of the system evaluated on large Czech corpora is quite high. The system is in regular use by hundreds of users from around the whole world.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Language Support A Simple Technique for Typesetting Hebrew with Vowel Points

This paper describes a simple mechanism for typesetting Hebrew with vowel points. Hebrew uses a large set of accents that represent vowels, consonant modifiers, and cantillation instructions. These accents are placed above, below, or inside letters; a single letter can carry several accents. The solution that we describe, which is designed for PostScript [2] output devices, leaves the placement...

متن کامل

Positional variability of pitch accents in Czech

An analysis of prenuclear accents in read speech is carried out with the aim of finding instances of regularity in their distribution. Significant differences are identified with respect to position within the phrase and phrase length, some of which are correlated with declination and pitch span narrowing. Only a weak interaction is found between nuclear and prenuclear pitch accents. No tendenc...

متن کامل

Prosodic Phrases and Semantic Accents in Speech Corpus for Czech TTS Synthesis

We describe a statistical method for assignment of prosodic phrases and semantic accents in read speech data. The method is based on statistical evaluation of listening test data by a maximum-likelihood approach with parameters estimated by an EM algorithm. We also present linguistically relevant quantitative results about the prosodic phrase and semantic accent distribution in 250 Czech

متن کامل

Pitch Accents, Boundary Tones and Contours: Automatic Learning of Czech Intonation

The present paper examines three methods of intonational stylization in the Czech language: a sequence of pitch accents, a sequence of boundary tones, and a sequence of contours. The efficiency of these methods was compared by means of a neural network which predicted the f0 curve from each of the three types of input, with subsequent perceptual assessment. The results show that Czech intonatio...

متن کامل

Acoustic analysis of Czech stress: intonation, duration and intensity revisited

By examining acoustic marks of Czech stress, this paper attempts to provide an answer to the question of whether or not perceived accents in the Czech language have an objective existence. A neural network is used to predict the position of accents without lexical information. Three parameters (intonation, duration and intensity) are considered individually, in pairs and altogether. Fundamental...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012